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ANALYSIS OF CASINO SHELF SHUFFLING MACHINES 

PERSI DIACONIS, JASON FULMAN, AND SUSAN HOLMES 

Abstract. Many casinos routinely use mechanical card shuffling machines. We were 
asked to evaluate a new product, a shelf shuffler. This leads to new probability, new 
combinatorics, and to some practical advice which was adopted by the manufacturer. The 
interplay between theory, computing, and real-world application is developed. 

1. Introduction 



K- 5 We were contacted by a manufacturer of casino equipment to evaluate a new design for 

tH- a casino card-shuffling machine. The machine, already built, was a sophisticated "shelf 

T-H shuffler" consisting of an opaque box containing ten shelves. A deck of cards is dropped 

^_^ into the top of the box. An internal elevator moves the deck up and down within the 

^^ box. Cards are sequentially dealt from the bottom of the deck onto the shelves; shelves are 

r \ chosen uniformly at random at the command of a random number generator. Each card is 

randomly placed above or below previous cards on the shelf with probability 1/2. At the 
end, each shelf contains about 1/10 of the deck. The ten piles are now assembled into one 
pile, in random order. The manufacturer wanted to know if one pass through the machine 
^ S^ would yield a well-shuffled deck. 

Testing for randomness is a basic task of statistics. A standard approach is to design 
^-H some ad hoc tests such as: Where do the original top and bottom cards wind up? What 

^ is the distribution of cards that started out together? What is the distribution, after one 

\^ shuffle, of the relative order of groups of consecutive cards? Such tests had been carried 

Q^ out by the engineers who designed the machine, and seemed satisfactory. 

^N We find closed-form expressions for the probability of being at a given permutation after 

C^^ the shuffle. This gives exact expressions for various global distances to uniformity, e.g., 

^--^ total variation. These suggest that the machine has flaws. The engineers (and their bosses) 

,_^ needed further convincing; using our theory, we were able to show that a knowledgeable 

H" player could guess about 9 1/2 cards correctly in a single run through a 52-card deck. For 

. ^H a well-shuffled deck, the optimal strategy gets about 4 1/2 cards correct. This data did 

/\ convince the company. The theory also suggested a useful remedy. Journalist accounts of 



JS our shuffling adventures can be found in Klarreich (2003 2002); Mackenzie (2002). 



Section 2 gives background on casino shufflers, needed probability, and the literature of 



shuffling. Section 3 gives an analysis of a single shuffle; we give a closed formula for the 
chance that a deck of n cards passed through a machine with m shelves is in final order 
w. This is used to compute several classical distances to randomness. In particular it is 
shown that, for n cards, the /(oo) distance is asymptotic to e^'^'^'^ — 1 if the number of 
shelves m = crc'l'^ and n is large. The combinatorics of shelf shufflers turns out to have 
connections to the "peak algebra" of algebraic combinatorics. This allows nice formulae for 
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the distribution of several classical test statistics: the cycle structure (e.g., the number of 
fixed points), the descent structure, and the length of the longest increasing subsequence. 
Section 4] develops tools for analyzing repeated shelf shuffling. Section 5] develops our 



"how many can be correctly guessed" tests. This section also contains our final conclusions. 

2. Background 



This section gives background and a literature review. Section 2.1 treats shuffling ma- 



chines; Section 2.2 gives probability background; Section 2.3 gives an overview of related 



literature and results on the mathematics of shuffling cards. 

2.1. Card shuffling machines. Casinos worldwide routinely employ mechanical card- 
shuffling machines for games such as blackjack and poker. For example, for a single deck 
game, two decks are used. While the dealer is using the first deck in the usual way, the 
shuffling machine mixes the second deck. When the first deck is used up (or perhaps half- 
used), the second deck is brought into play and the first deck is inserted into the machine. 
Two-, four-, and six-deck machines of various designs are also in active use. 

The primary rationale seems to be that dealer shuffling takes time and use of a machine 
results in approximately 20% more hands per hour. The machines may also limit dealer 
cheating. 

The machines in use are sophisticated, precision devices, rented to the casino (with service 
contracts) for approximately $500 per month per machine. One company told us they had 
about 8,000 such machines in active use; this amounts to millions of dollars per year. The 
companies involved are substantial businesses, listed on the New York Stock Exchange. 

One widely used machine simulates an ordinary riffle shuffle by pushing two halves of a 
single deck together using mechanical pressure to make the halves interlace. The randomness 
comes from slight physical differences in alignment and pressure. In contrast, the shelf 
shufflers we analyze here use computer-generated pseudo-random numbers as a source of 
their randomness. 

The pressure shufflers require multiple passes (perhaps seven to ten) to adequately mix 
52 cards. Our manufacturer was keen to have a single pass through sufflce. 

2.2. Probability background. Let S'n denote the group of permutations of n objects. Let 
U{a) = 1/n! denote the uniform distribution on Sn- If P is a probability on Sn, the total 
variation, separation, and loo distances to uniformity are 

(2.1) 

11^ - f^llTV = IY1 I^H - ^Hl = max \P{A) - U{A)\ = J max \P{f) - U{f)\, 

P(w) 



P(w 
sep(P) = max ( 1 - ;' ) , ||P - U\\oo = max 

w \ U[W) / w 



1 



U{w) 



Note that \\P — U\\tv < sep(P) < ||P — C/||oo- The first two distances are less than 1; the 
II lloo norm can be as large as n! — 1. 

If one of these distances is suitably small then many test statistics evaluate to approx- 
imately the same thing under P and U. This gives an alternative to ad hoc tests. The 
methods developed below allow exact evaluation of these and many further distances (e.g., 
chi-square or entropy). 

Repeated shuffling is modeled by convolution: 

P * P{w) = ^ P{v)P{wv~^), P*''{w) = P * P*('=-i)(u;). 
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All of the shelf shufflers generate ergodic Markov chains (even if only one shelf is involved), 
and so P*^{w) — >• U{w) as A; — >■ oo. One question of interest is the quantitative measurement 
of this convergence using one of the metrics above. 

2.3. Previous work on shuffling. 

Early work. The careful analysis of repeated shuffles of a deck of cards has challenged 
probabilists for over a century. The first efforts were made by Markov (1906) in his papers 
on Markov chains. Later, Poincare (1912) studied the problem. These great mathematicians 
proved that in principle repeated shuffling would mix cards at an exponential rate but gave 
no examples or quantitative methods to get useful numbers in practical problems. 



Borel and Cheron (1955) studied riffle shuffling and concluded heuristically that about 



seven shuffles would be required to mix 52 cards. Emile Borel also reported joint work with 
Paul Levy, one of the great probabilists of the twentieth century; they posed some problems 
but were unable to make real progress. 

Isolated but serious work on shuffling was reported in a 1955 Bell Laboratories report by 
Edgar Gilbert. He used information theory to attack the problems and gave some tools for 
riffle shuffling developed jointly with Claude Shannon. 

They proposed what has come to be called the Gilbert-Shannon-Reeds model for riffle 
shuffling; this presaged much later work. Thorp ( 1973 ) proposed a less realistic model and 
showed how poor shuffling could be exploited in casino games. Thorp's model is analyzed 
in 



Morris (2009) 



Epstein (1977) reports practical studies of how casino dealers shuffle with 
data gathered with a very precise microphone! The upshot of this work was a well-posed 
mathematics problem and some heuristics; further early history appears in Chapter 4 of 



Diaconis (1988). 



The modern era. The modern era in quantitative analysis of shuffling begins with papers 



of Diaconis and Shahshahani (1981) and Aldous (1983). They introduced rigorous meth- 



ods, Fourier analysis on groups, and coupling. These gave sharp upper and lower bounds, 
suitably close, for real problems. In particular, Aldous sketched out a proof that | log2 n 
riffle shuffles mixed n cards. A more careful argument for riffle shuffling was presented by 
Aldous and Diaconis (1986). This introduced "strong stationary times," a powerful method 



of proof which has seen wide application. It is applied here in [Section 4 



A definitive analysis of riffle shuffling was finally carried out in Bayer and Diaconis ( 1992 ) 



and Diaconis et al. (1995). They were able to derive simple closed- form expressions for all 
quantities involved and do exact computations for n = 52 (or 32 or 104 or . . . ). This results 
in the "seven shuffles theorem" explained below. A clear elementary account of these ideas 
is in iMannl (119941 11995|) reprinted in iGrinstead and Snelll (|1997|). See lEthierl (poTol) for an 



informative textbook account. 

The successful analysis of shuffling led to a host of developments, the techniques refined 
and extended. For example, it is natural to want not only the order of the cards, but 
also the "up-down pattern" of one-way backs to be randomized. Highlights include work 



of Bidigare et al. ( 1999 ) and Brown and Diaconis ( 1998 ) who gave a geometric interpre- 



tation of shuffling which had many extensions to which the same analysis applied. L alley 
(1996, 1999) studied less random methods of riffle shuffling. Fulman ( 2000a|b 2001) showed 
that interspersing cuts doesn't materially effect things and gave high level explanations for 
miraculous accidents connecting shuffling and Lie theory. The work is active and ongoing. 



In recent work, Diaconis et al 



Recent surveys are given by Diaconis (1996, 2003); Fulman (1998); Stark et al. (2002). 



(1995); Conger and Viswanath (2006) and Assaf et al. 

( [2011 ) have studied the number of shuffles required to have selected features randomized 
(e.g., the original top card, or the values but not the suits). Here, fewer shuffles sufflce. 
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Conger and Howald (2010) shows that the way the cards are dealt out after shuffling affects 



things. The mathematics of shuffling is closely connected to modern algebraic combinatorics 
through quasi-symmetric functions ^Stanley , ( ,2001| ) . The descent theory underlying shuffling 
makes equivalent appearances in the basic task of carries when adding integers |Diaconis 
and Fulman ( 2009a|b >; Diaconis and Fulman (2011). 



3. Analysis of one pass through a shelf shuffler 
This section gives a fairly complete analysis of a single pass through a shelf shuffler. 



Section 3.1 gives several equivalent descriptions of the shuffle. In Section 3.2, a closed-form 
formula for the chance of any permutation w is given. This in turn depends only on the 
number of "valleys" in w. The number of permutations with j valleys is easily calculated 
and so exact computations for any of the distances above are available. Section 3.3| uses the 
exact formulae to get asymptotic rates of convergence for /(oo) and separation distances. 



Section 0| gives the distribution of such permutations by cycle type. [Section 3. 5| gives the 
distribution of the "shape" of such a permutation under the Robinson-Schensted-Knuth 
map. Section 3.6| gives the distribution of the number of descents. We find it surprising that 



a real-world applied problem makes novel contact with elegant combinatorics. In Section 4 



iterations of a shelf shuffler are shown to be equivalent to shelf shuffling with more shelves. 
Thus all of the formulae of this section apply. 

3.1. Alternative descriptions. Consider two basic shelf shufflers: for the first, a deck of 
n cards is sequentially distributed on one of m shelves. (Here, n = 52, m = 10, are possible 
choices.) Each time, the cards are taken from the bottom of the deck, a shelf is chosen at 
random from one to m, and the bottom card is placed on top of any previous cards on the 
shelf. At the end, the packets on the shelves are unloaded into a final deck of n. This may 
be done in order or at random; it turns out not to matter. Bayer and Diaconis ( 1992 ) called 
this an inverse m-shujfle. 

This shuffling scheme, that is the main object of study, is based on m shelves. At each 
stage that a card is placed on a shelf, the choice of whether to put it on the top or the 
bottom of the existing pile on that shelf is made at random (1/2 each side). This will be 
called a shelf shuffle. There are several equivalent descriptions of shelf shuffles: 

Description 1 (Shelf shuffles). A deck of cards is initially in order 1, 2, 3, . . . , n. Label the 
back of each card with n random numbers chosen at random between one and 2m. Remove 
all cards labeled 1 and place them on top, keeping them in the same relative order. Then 
remove all cards labeled 2 and place them under the cards labeled 1, reversing their relative 
order. This continues with the cards labeled 3, labeled 4, and so on, reversing the order in 
each even labeled packet. If at any stage there are no cards with a given label, this empty 
packet still counts in the alternating pattern. 
For example, a twelve-card deck with 2m = 4, 

Label 211433124 3 4 1 
Card 1 2 3 4 5 6 7 8 9 10 11 12 

is reordered as 

2 3 7 12 8 1 5 6 10 11 9 4. 

Description 2 (Inverse shelf shuffles). Cut a deck of n cards into 2m piles according to a 
multinomial distribution; thus the number of cards cut off in pile i has the same description 
as the number of balls in the ith box if n balls are dropped randomly into 2m, boxes. Reverse 
the order of the even-numbered packets. Finally, riffle shuffle the 2m packets together by 



the Gilbert-Shannon-Reeds (GSR) distribution Bayer and Diaconis ( |1992) dropping each 
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card sequentially with probability proportional to packet size. This makes all possible 
interleavings equally likely. 




1 

Figure 1. Two shelves in shelf shuffle. 



Description 3 (Geometric description). Consider the function fmi^) from [0, 1] to [0,1 



13 5 

2m ' 2m ' 2m ' ' 



2m-l 
2m 



Figure 1 



illustrates 



which has "tents," each of slope ±2771 centered at 
an example with m = 2. Place n labeled points uniformly at random into the unit interval. 
Label them, from left to right, xi,X2, • • ■ ,Xn- Applying fm gives yi = fm{xi). This gives 
the permutation 

1 2 ••• n 
TTi vr2 • • • 7r„ 

with TTi the relative position from the bottom of yi , . . . , ttj the relative position from the 
bottom of yi among the other yj. This permutation has the distribution of an inverse shelf 
shuffle. It is important to note that the natural distances to uniformity (total variation, 
separation, loo) are the same for inverse shuffles and forward shuffles. In Section 4, this 



description is used to show that repeated shelf shuffling results in a shelf shuffle with more 
shelves. 

3.2. Formula for the chance of a permutation produced by a shelf shuffler. To 

describe the main result, we call i a valley of the permutation t(;GS'„ifl<i<n and 
w{i — 1) > w{i) < w{i + 1). Thus w = 5136724 has two valleys. The number of valleys is 



classically used as a test of randomness for time series. See Warren and Seneta (1996) and 



their references. If v{n, k) denotes the number of permutations on n symbols with k valleys 

then Warren and Seneta (1996) f;(l,0) = 1, v{n,k) 

l,k 



{2k + 2)v{n -l,k) + {n- 2k)v{n 



1). So v{n,k) is easy to compute for numbers of practical interest. Asymptotics are 
in Warren and Seneta] ( |1996| ) which also shows the close connections between valleys and 
descents. 

Theorem 3.1. The chance that a shelf shuffler with m shelves and n cards outputs a 
permutation w is 

''n + m — a — 1\ fn — 1 — 2v{wy 
a — v{w) 



^v{w)+l '"-I 



\n Z_^ 



2(2m)" -^-^ V n 

^ ' a=0 ^ 

where v{yS) is the number of valleys ofw. This can be seen to be the coefficient of t^ in 

1 (l + t)n+l/ 4t \v{w)+l 



2(2m)"(l-t; 



n+l 



(l+t)2 
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Example. Suppose that m = 1. Then the theorem yields the uniform distribution on the 
2""^ permutations with no vaheys. Permutations with no valleys are also sometimes called 
unimodal permutations. These arise in social choice theory through Coombs' "unfolding" 



hypothesis (Diaconis 1988, Chap. 6). 



Remark. By considering the cases m>n and n > mwe see that, in the formula of Theorem 

1 instead of m 



3.1 , the range of summation can be taken up to n 
later. 



1. This will be useful 



Theorem 3.1 makes it easy to compute the distance to stationarity for any of the metrics 



m 



Section 2.2 Indeed, the separation and l{oo) distance is attained at either permutations 
with a maximum number of valleys (when n = 52, this maximum is 25) or for permutations 
with valleys. For the total variation distance, with Pm{v) denoting the probability in 
Theorem |3.1[ 



U\ 



TV 



1 

- Y^ v{n,a) 



a=0 



Pm{a) 



1 

n! 



Table 1 . Distances for various numbers of shelves 



m 


10 


15 


20 


25 


30 


35 


50 


100 


150 


200 


250 


300 


\\Pm - C^IItV 


1 


.943 


.720 


.544 


.391 


.299 


.159 


.041 


.018 


.010 


.007 


.005 


sep(Pm) 


1 


1 


1 


1 


1 


.996 


.910 


.431 


.219 


.130 


.085 


.060 


\\Pm-UU 


oo 


oo 


oo 


45118 


3961 


716 


39 


1.9 


.615 


.313 


.192 


.130 



Table 1 gives these distances when n = 52 for various numbers of shelves m. Larger values 



of m are of interest because of the convolution results explained in Section 4} These numbers 
show that ten shelves are woefully insufficient. Indeed, 50 shelves are hardly sufficient. 



To prove Theorem 3.1 , we will relate it to the following 27ra-shuffie on the hyperoctahedral 
group Bn'- cut the deck multinomially into 2m piles. Then ffip over the odd numbered stacks, 
and riffie the piles together, by dropping one card at a time from one of the stacks (at each 
stage with probability proportional to stack size). When m = 1 this shuffie was studied in 
Bayer and Diaconis (|1992>, and for larger m it was studied in Fulman (2001). 



It will be helpful to have a description of the inverse of this 27Ti-shuffle. To each of the 
numbers {1, . . . , n} is assigned independently and uniformly at random one of —1, 1, —2, 2, 
. . . , — m, m. Then a signed permutation is formed by starting with numbers mapped to —1 
(in decreasing order and with negative signs), continuing with the numbers mapped to 1 
(in increasing order and with positive signs), then continuing to the numbers mapped to 
—2 (in decreasing order and with negative signs), and so on. For example the assignment 

{1, 3, 8} ^ -1, {5} ^ 1, {2, 7} ^ 2, {6} ^ -3, {4} ^ 3 

leads to the signed permutation 

-8 -3 -1527 -6 4. 



The proof of Theorem |3.1| depends on an interesting relation with shuffles for signed 
permutations (hyperoctahedral group). This is given next followed by the proof of Theorem 

EH 



Theorem 3.2 gives a formula for the probability for w after a hyperoctahedral 2m-shuffle, 
when one forgets signs. Here p{w) is the number of peaks of w, where i is said to be a peak 
of li; if 1 < i < n and w{i — 1) < w{i) > w{i + 1). Also K{'w) denotes the peak set of w 
and D{w) denotes the descent set of w (i.e., the set of points i such that w{i) > w{i + 1)). 
Finally, let [n] = { 1 , . . . , n} . 
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Theorem 3.2. The chance of a permutation w obtained by performing a 2m shuffle on the 
hyperoctahedral group and then forgetting signs is 



^{w i)+i -^ ^n + m-a-l\(n-l- 2p{w 



2{2m)"- ^-^ \ n J\ a — p{w ^) 

where p{w~^) is the number of peaks ofw^^. 

Proof. Let P'{m) denote the set of nonzero integers of absolute value at most m, totally 
ordered so that 

-l^l^-2^2^---^ -m -< m. 



Then given a permutation w = (it;i,--- ,Wn)^ page 768 of Stembridge (1997) defines a 
quantity A(u'). (Stembridge calls it A(u;,7), but throughout we always choose 7 to be the 
identity map on [n], and so suppress the symbol 7 whenever he uses it). By definition, 
/S.{w) enumerates the number of maps / : [n] 1— t- P'{m) such that 

• f{wi)<---<f{Wn) 

• f{wi) = f{wi+i) >0 ^i^ D{w) 

• f{wi) = f{wi+i) < ^ i G D{w) 

We claim that the number of maps / : [n] 1— )• P'{m) with the above three properties is 
equal to (2m)" multiplied by the chance that a hyperoctahedral 2m-shuffle results in the 
permutation w~^. This is most clearly explained by example: 

w = 8 3 1527 64 
/ = -1 -1 -1 1 2 2 -3 3 

lYiova. the inverse description of the hyperoctahedral 2m-shuffle stated before the proof, 
the assignment yields w. This proves the claim. 

Let A.{w) denote the set of peaks of w. From Proposition 3.5 of Stembridge (1997), 

A(u>) = 2P('")+i Y^ Le. 

EC[n-l]:A(w)CEA{E+l) 

Here 

Le= Y1 1' 

l<ii<---<i„<m 
keE^ik<ik+l 

and A denotes symmetric difference, i.e., AAB = (A — B) U {B — A). Now a simple 
combinatorial argument shows that Le = (" „ )• Indeed, Le is equal to the number 
of integral ii, . . . ,in with 1 < «i • • • < in ^ m — \E\, which by a "stars and bars" argument 

is (n+m-\E\-ly ^j^^g 

A(^) = 2^H+i J2 fn + m-\E\-l\ 

ECln-l]:A{w)CEA{E+l) ^ ^ 

Now let us count the number of E of size a appearing in this sum. For each j E ^(w), 
exactly one of j or j — 1 must belong to E, and the remaining n — 1 — 2p{w) elements of 
[n — 1] can be independently and arbitrarily included in E. Thus the number of sets E of 
size o appearing in the sum is 2p("''("~_~,^1"''). Hence 

^Piw)+i '^/n + rn-a-l\/n-l- 2p(w)\ 

"°*°^Sv » Jv "-Pi-) )' 

which completes the proof. D 
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Proof of Theorem 3.1. To deduce Theorem 3.1 from Theorem 3.2 , it is not hard to see that a 
shelf shuffle with m shelves is equivalent to taking w' to be the inverse of a permutation after 
a hyperoctahedral 2?TT,-shuffle, then taking a permutation w defined by w{i) = n — w'{i) + 1. 
Thus the shelf shuffle formula is obtained from the hyperoctahedral 27n-shufHe formula by 
replacing peaks by valleys. D 

Remarks. 



The paper Fulman (2001) gives an explicit formula for the chance of a signed per- 



mutation after a 277i-shuffle on Bn in terms of cyclic descents. Namely it shows this 
probability to be 

/m+n—cd{w~^)\ 

(2m)" 
where cd{w) is the number of cyclic descents of w, defined as follows: Ordering the 
integers l<2<3<---<---<-3<-2< -1, 

— w has a cyclic descent at position i for l<i<n — lif w{i) > w{i + 1). 

— w has a cyclic descent at position n if w{n) < 0. 

— w has a cyclic descent at position 1 if w{l) > 0. 

For example the permutation 3 1 — 2 4 5 has two cyclic descents at position 1 and 
a cyclic descent at position 3, so cd{w) = 3. 

This allows one to study aspects of shelf shufflers by lifting the problem to Bn, 
using cyclic descents (where calculations are often easier), then forgetting about 
signs. This idea was used in Fulman (2001 ) to study the cycle structure of unimodal 



permutations, and in Aguiar et al. (2004) to study peak algebras of types B and D. 



The appearance of peaks in the study of shelf shufflers is interesting, as peak algebras 
have appeared in various parts of mathematics. Nyman (2003) proves that the 



peak algebra is a subalgebra of the symmetric group algebra, and connections with 



geometry of polytopes can be found in Aguiar et al. (2006) and Billera et al. (2003). 



There are also close connections with the theory of P-partitions Petersen (2005 



2007); Stembridge (1997). 



The following corollary shows that for a shelf shuffler of n cards with m shelves, the 
chance of a permutation w with v valleys is monotone decreasing in v. Thus, the identity 
(or any other unimodal permutation) is most likely and an alternating permutation (down, 
up, down, up, . . . ) is least likely. From Theorem 3.1 the chance of a fixed permutation 
with V valleys is 



(3.1) 



i«+i 



PM 



n-l 

E 

a=0 



n + m — 1 



n 



\-2v 



2(2m)" ^-i V n 

Corollary 3.3. For P{v) defined at ([33]), P{v) > P{v + l 

Proof. Canceling common terms, and setting a — v = j 

have '-^P{v) = E]=o~'''fiJ + v)r\~'l = 2--^-^- E{f{Sn-i-2v + v)) with /(a 
-1- 



( „ ) and Sn 
equivalent to 

(3.2) E (/ {Sn 

To prove this, represent S, 



2v distributed as Binomial (n 



0<v<{n- l)/2. 

j + v) in (3.1), we 



so a 



2v, 2)- The proposed inequality is 



l-2i) 



2v +V))>E if {Sn-l-2v-2 + V + 1)) . 

= Sn-i-2v-2 + ^1 + ^2i with Yi independent taking values 



in {0,1}, with probability 1/2. Then (3.2) is equivalent to 
(3.3) 

Yl \f(^ + ^) + If^^ + « + !) + \f{J + v + 2)-f{j + v + l) 



P{Sn-l-2v-2 = j} > 0. 
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Thus if ^/(i + v) + |/(j + v + 2) > f{j + v + 1), e.g., /(a) is convex, we are done. Writing 
out the expression /(a) + /(a + 2) > 2/(a + 1) and cancehng common terms, it must be 
shown that 

(3.4) (m + n — 1 — a)(m + n — 2 — a) + (m — 1 — a)(m — 2 — o) > 2{m + n — 2 — a){m — l — a) 

for all < a < n — 1. Subtracting the right side from the left, the coefficients of a^ and a 
cancel, leaving n{n — 1) > 0. D 

3.3. Asymptotics for the 



\P-U\ 



max,, 



1 



P(w) 
U{w) 



P — U\\oo and separation distances. Recall the distances 
and sep(P) = max^ (l - ^^ j . 

Theorem 3.4. Consider the shelf shuffling measure Pm with n cards and m shelves. Sup- 
pose that m = crc'l'^ . Then, as n tends to infinity with < c < oo fixed, 

,l/(12c2) 



\Pm - U\ 

sep(Pm) 



oo ~ e^""-^ ' - 1, 

^_g-l/(24c2)_ 



Remark. We find it surprising that this many shelves are needed. For example, when n = 52, 
to make the distance less than 1/100, m = 1,085 shelves are required for \\Pm — U\\qo and 
m = 764 are required for sep(Pm)- 



Proof. Using Corollary |3.3[ the distance is achieved at the identity permutation or a per- 
mutation with [(n — l)/2j valleys. For the identity, consider n\Pm{id). Using Theorem 



(3.5) 



n-l 



,^ ,_, 2(n!) '■^ fm + n- a-l\ fn- 1 

n!P„(id) = 77^:::^ 2^ 



{2my 



a=0 



n 



To bound this sum, observe that ("~ )/2"~^ is the binomial probability density. To keep 
the bookkeeping simple, assume throughout that n is odd. The argument for even n is 
similar. 



For a = "^ — h j, the local central limit theorem as in 



Feller 



(1968, Chap. VII.2), shows 



(3.6) 



' n-l \ 



-2f/n 



2"-i y/7m72 

In the following, we show further that 



for j = o{n^^^) 



(3.7) 



n! ( m + 



m" 



n-l 
2 

n 



J 



r^ e 24c2 cv^ uniformly for j = o{n). 



Combining (3.6), (3.7), gives a Riemann sum for the integral 



e 24c^ 



V^ 



-2x2+x/c^^^gl/(12c2) 



the claimed result. This part of the argument follows Feller (1968, Chap. VII.2) and we 
suppress further details. To complete the argument the tails of the sum in (3.5) must be 
bounded. 



We first prove (3.7). From the definitions 



n\ fm — j + 
m" V n 



n-l 
2 



n-l 

2 

n 



n-l 
2 



J * 

— + — 

m m 



10 

using log(l 
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-X 



2 



(3.8) 



n-l 
2 

E 

n-l 
2 



log 1 



— + — 



+ 0(x3), 



E 



— + — 

ra m 



1 



E 



m 



2 + 12 



2 

1 n{n'^ — 1) 



— + — 



+ nO 



n Y 



1 



c^ 2c2n2 24c2 



m^ 



+ 



+ 



1 



The error term in (3.8) is uniform in j. For j = o{n), j /n = o(l) and (3.7) follows 



To bound the tails of the sum, first observe that (|3.8|) implies that ^["' ""^ ^ 



n! (m~j+- 



for all j. From Bernstein's inequality, if Xj = ±1 with probability 1/2, P{\Xi + - ■ •+X„_i| > 

a) < 2e~" /("~^). Using this, the sum over \j\ > An^'^ is negligible for A sufficiently large. 

The Gaussian approximation to the binomial works for j <^ ■n?''^. To bound the sum for 



IjI between n^'^ and n^'^, observe from (3.8) that in this range 



n! (m—j+- 



o 



Then, Feller (1968 p. 195) shows 



7,1/4 



' n— 1 \ 



with f{x) = Yj 



oo 
a=3 



2n-l 



^■nn/2 



-i(j)2/(n/4)-/(i/\A7i) 



a-2 



„4 „6 

Clt+C2^ + 



for explicit constants 



ci, C2, . . . . For Q\r?'^ < \j\ < 92n^'^, the sum under study is dominated by A Ylj>n^/3 e~^^ 
which tends to zero. 

The separation distance is achieved at permutations with ^^^ valleys (recall we are 



assuming that n is odd). From (3.1), 

1 



nlPn 



n 



n\ 



m" 



m + 



n-l 



n 



The result now follows from (3.7) with j = 0. 



D 



Remark. A similar argument allows asymptotic evaluation of total variation. We have not 
carried out the details. 

3.4. Distribution of cycle type. The number of fixed points and the number of cycles 
are classic descriptive statistics of a permutation. More generally, the number of i-cycles for 



1 < i < n has been intensively studied Shepp and Lloyd (1966); Diaconis et al. (1995). This 



section investigates the distribution of cycle type of a permutation w produced from a shelf 
shuffler with m shelves and n cards. Similar results for ordinary riffle shuffles appeared in 



Diaconis et al. (1995), and closely related results in the type B case (not in the language 
of shelf-shuffling) appear in Fulman (2001, 2002). Recall also that in the case of one shelf, 
the shelf shuffler generates one of the 2"~^ unimodal permutations uniformly at random. 
The cycle structure of unimodal permutations has been studied in several papers in the 
literature: see Fulman (2001, 2002); Thibon (20011) for algebraic/combinatorial approaches 
and Gannon (2001); Rogers (1981) for approaches using dynamical systems. 
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For what follows, we define 

k^ = ^ E M(d)(2m)^/'^ 

d\i 
d odd 

where ^ is the Moebius function of elementary number theory. 

Theorem 3.5. Let Pm{w) denote the probability that a shelf shuffler with m shelves produces 
a permutation w. Let Ni{w) denote the number of i-cycles of a permutation w in 5„. Then 

n>l weSn i>l «>1 



Proof. By the proof of Theorem 3.1, a permutation produced by a shelf shuffler with m 
shelves is equivalent to forgetting signs after the inverse of a type B riffle shuffle with 2m 
piles, then conjugating by the longest element n, n — 1, . . . , 1. Since a permutation and 
its inverse have the same cycle type and conjugation leaves cycle type invariant, the result 



follows from either (Fulman 2001, Thm. 7) or (Fulman, 2002, Thm. 9), both of which 
derived the generating function for cycle type after type B shuffles. D 

Theorem |3.5| leads to several corollaries. We say that a random variable X is Bino- 
mial(n,p) if F{X = j) = (")|>'(1 — p)^~^ , < j < n, and that X is negative binomial with 

parameters {f,p) if F{X = j) = (-^ -^^ )p'{^ — p)^ , < j < oo. 

Corollary 3.6. Let Ni(w) be the number of i-cycles of a permutation w. 

(1) Fix u such that < u < 1. Then choose a random number N of cards so that 
¥(N = n) = (1 — n)u". Let w be produced by a shelf shuffler with m shelves and N 
cards. Then any finite number of the random variables {Ni} are independent, and 

Ni is distributed as the convolution of a Binomiall fi^rn, ilT /T u ) '^^'^ ^ negative 

binomial with parameters {fi^m, {u/2mY). 

(2) Let w be produced by a shelf shuffler with m shelves and n cards. Then in the n — )• oo 
limit, any finite number of the random variables {Ni} are independent. The Ni are 

distributed as the convolution of a Binomiall fi,m, tt — vrrT ) o,i^d a negative binomial 

with parameters {fi^m, (l/2?7i)*). 



l + (u/2m)*^-^'' 



Proof. Setting all a;, = 1 in equation ( |3.9[ ) yields the equation 
(3.10) (i_^)-i = ]j('l 



i>l 



(n/2m)' 



Taking reciprocals of equation ( |3.10[ ) and multiplying by equation (3.9) gives the equality 
(3.11) (1 - n) + ^(1 - n)u- Yl ^-H n 

n>l w£Sn «>1 



^ni{w) 



_T-r fl + Xi{u/2my \ ^'■™ / 1 - {u/2my ^ ^''' 
~ J-J- V 1 + {u/2mf ) ' \1 - Xi{u/2m) 

This proves part 1 of the theorem, the first term on the right corresponding to the convo- 
lution of binomials, and the second term to the convolution of negative binomials. 

The second part follows from the claim that if a generating function f{u) has a Taylor 
series which converges at « = 1, then the n — >■ cx) limit of the coefficient of w" in /(u)/(l — 
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°° Q flnii" and observe that the 



u) is /(I). Indeed, write the Taylor expansion /(u) = Yl 

coefficient of u"' in f{u)/{l — u) is X]r=o^*- Now apply the claim to equation (3.11) with 

all but finitely many Xi equal to 1. D 

Remark. For example, when i = 1, /j,m = "t-! the number of fixed points are distributed 



as a sum of Binomial ( m 



1 



' 2m+l 



and negative binomial(?7z, 2^)- Each of these converges 
to Poisson(l/2) and so the number of fixed points is approximately Poisson(l). A similar 



analysis holds for the other cycle counts. Corollary 3.6 could also be proved by the method 



of moments, along the lines of the arguments of Diaconis et al. ( 1995 ) for the case of ordinary 
riffie shuffles. 

For the next result, recall that the limiting distribution of the large cycles of a uniformly 
chosen permutation in Sn has been determined by Goncharov Gontcharoff (1942); Gon 



charov (1944), Shepp and Lloyd (1966), Vershik and Shmidt (1977, 1978), and others. For 



instance the average length of the longest cycle Li is approximately .63n and Li/n has a 
known limiting distribution. The next result shows that even with a fixed number of shelves, 
the distribution of the large cycles approaches that of a uniform random permutation, as 
long as the number of cards is growing. We omit the proof, which goes exactly along the 



lines of the corresponding result for riffle shuffles in Diaconis et al. (1995). 



Corollary 3.7. Fix k and let Li{'w), L2{w), . . . , Lk{w) he the lengths of the k longest cycles 
of w G Sn produced by a shelf shuffler with m shelves. Then for m fixed, or growing with n, 
as n ^ oo, 



\Pm{Li/n <ti,... ,Lk/n < tn} 
uniformly in ti,t2, ■ ■ ■ ,tk. 



Poo{Li/n < ti,...,Lk/n < tn}\ 







As a final corollary, we note that Theorems 3.1 and 3.5 give the following generating 
function for the joint distribution of permutations by valleys and cycle type. Note that 
this gives the joint generating function for the distribution of permutations by peaks and 
cycle type, since conjugating by the permutation n, n — 1, . . . , 1 preserves the cycle type and 
swaps valleys and peaks. 

Corollary 3.8. Let v{w) denote the number of valleys of a permutation w. Then 



t 



+E 






1(1+*) 



n+l 



4t 



f(ui)+l 



n 



N,{w) 



i>l 






i>l 



XjU' 



n>l W(£Sn ^ ' \\ ' J 

The same result holds with v{w) replaced by p{w), the number of peaks ofw. 

Remark. There is a large literature on the joint distribution of permutations by cycles 



(2000b); Blessenohl et al. (2005); Poirier (1998) and by cycles and cyclic descents 



and descents Gessel and Reutenauer (1993); Diaconis et al. (1995); Reiner (1993); 



Fulman 



Fulman 



(2001, 2002 2000a), but Corollary 3.8 seems to be the first result on the joint distribution 



by cycles and peaks. 

3.5. Distribution of RSK shape. In this section we obtain the distribution of the Rob- 
inson-Schensted-Knuth (RSK) shape of a permutation w produced from a shelf shuffler 
with m shelves and n cards. For background on the RSK algorithm, see Stanley (1999). 



The RSK bijection associates to a permutation u; G S'n a pair of standard Young tableaux 
{P{w), Q{w)) of the same shape and size n. Q{w) is called the recording tableau of w. 

To state our main result, we use a symmetric function S\ studied in Stembridge ( 1997) (a 
special case of the extended Schur functions in Kerov and Vershik (1986)). One definition 
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of the S\ is as the determinant 

Sx{y) = det{qx,-i+j) 
where q-r = for r > and for r > 0, Qr is defined by setting 



13 



E*'"=ni 



i + mt 



n>0 «>1 

We also let fx denote the number of standard Young tableaux of shape A. 

Theorem 3.9. The probability that a shelf shuffler with m shelves and n cards produces a 
permutation with recording tableau T is equal to 

1 

for any T of shape X, where Sx has m variables. Thus the probability that w has RSK shape 
A is equal to 

fx^ ri 1 



2" ^ Im' 



;Sx 



m 



m 



Proof. By the proof of Theorem 3.1, a permutation produced by a shelf shuffler with m 



shelves is equivalent to forgetting signs after the inverse of a type B 2?Ti-shuffle, and then 
conjugating by the permutation n n — 1 . . .1. Since a permutation and its inverse have the 
same RSK shape (Stanley, 1999, Sect. 7.13), and conjugation by n,n — 1, . . . , 1 leaves the 
RSK shape unchanged ( Stanley] 1999 Thm. Al.2.10), the result follows from Theorem 8 
of Fulman (2002), which studied RSK shape after type B riffle shuffles. D 



3.6. Distribution of descents. A permutation w is said to have a descent at position i 
(1 < i < n — 1) if w{i) > w{i + 1). We let d{w) denote the total number of descents of tt. 
For example the permutation 3 15 4 2 has d{u]) = 3 and descent set 1, 3, 4. The purpose of 
this section is to derive a generating function for the number of descents in a permutation 
w produced by a shelf shuffler with m shelves and n cards. More precisely, we prove the 
following result. 

Theorem 3.10. Let Pm{w) denote the probability that a shelf shuffler with m shelves and 
n cards produces a permutation w. Letting [u^]f{u) denote the coefficient of u"" in a power 
series f{u), one has that 



(3.12) 



w£Sn 



[1-t) 



n+1 



it>i 



fcr^n- 



:km 



(1 + u/m 
(1 -u/mY"^' 



The proof uses the result about RSK shape mentioned in Section 3.5, and symmetric 



function theory; background on these topics can be found in the texts Stanley ( 1999 ) and 



Macdonald (1995) respectively. 



Proof. Let w he & permutation produced by a shelf shuffler with m shelves and n cards. 
The RSK correspondence associates to w a, pair of standard Young tableaux (P{w), Q{w)) 
of the same shape. Moreover, there is a notion of descent set for standard Young tableaux, 



and by Lemma 7.23.1 of Stanley (1999), the descent set of vu is equal to the descent set of 
Q{w). Let fx{r) denote the number of standard Young tableaux of shape A with r descents. 
Then Theorem |3.9| implies that 



F{d{w) 



|A|=n 



fxir) 



Sx 



1 



m 



1 



m 
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By equation 7.96 of [Stanley (1999), one has that 

r>0 k>l 



E /a Wi'+' = (1 -*)"+' E ^a(i, ..., i)t' 



where in the kth suniniand, s\{l, . . . , 1) denotes the Schur function with k variables spe- 
ciahzed to 1. Thus 



Endirv)=r).f-^ = Y^Y.^S>^ 



r>0 



r>0 \X\=n 

\n+l 



1 1 



t 



r+1 



— ^ — Yl *^' Yl ^^ 



m m 

1 1 



fc>l |A|=n 



?n,' ' m 



(l-t) 



n+l 



k>l n>0\X\=n 



sa(1,...,1 

1 1 



A 1 , • • • , 
m m 



5A(l,...,l)-n". 



/,Froni Appendix A. 4 of Stembridge (1997), if A ranges over all partitions of all natural 
numbers, then 



Y,sx{x)Sx{y)= n Y 



M>i 



1 + Xjyj 

'iUj 



Setting xi = ■ ■ ■ = Xk = u and yi = ■ ■ ■ = Vm = -^ completes the proof of the theorem. D 
For what follows we let An{t) = X]^g_5 ^d(w)+i ]^^ ^.j^g generating function of elements in 



Sn by descents. This is known as the Eulerian polynomial and from page 245 of Comtet 
( |1974[ ), one has that 



(3.13) 



^„(t) = (l-t)"+i^t^A;". 



k>l 



This also follows by letting tti — t- cxd in equation (3.12). 

The following corollary derives the mean and variance of the number of descents of a 
permutation produced by a shelf shuffler. 

Corollary 3.11. Let w he a permutation produced by a shelf shuffler with m shelves and 
n > 2 cards. 

(1) The expected value of d{w) is ^^. 

(2) The variance of d{w) is ^^ + |^. 

Proof. The first step is to expand [n"] L |^^™L^ as a series in k. One calculates that 
„, (1 + u/m) 1 v^ fkm\ fkm + n — a — 1" 



(1 — u/m)^^ rn^ ^^-^ \ a 

^ ' ' a>0 



m 

1 

n\ 






a>0 



n — a 
{km) . . . (km — a + 1) 



(km + n — a — 1) . . . (km) 



{n — a)l 



2"n(n-l)(n-2) 2 , 
^ " + 12m^ " + 
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where the ... in the last equation denote terms of lower order in k. Thus Theorem 3.10 
gives 



Y,Pm{w)f 



»+l 



(1 - ty 



n+l 



n! 



k>l 



n-2, ,n 



(1-t) 



n-1 



(n-2)l 



^t^k''- 



fc>i 



+ {l-tfC{t) 



where C{t) is a polynomial in t. By equation (3.13), it follows that 

+ {l-tfC{t). 



^p„Ht'^(-)+i = ^ + (i-t)2™"^^"-'(*^ ' '^ ^^3, 



n\ 



I2m? (n-2)! 



Since the number of descents of a random permutation has mean (n — l)/2 and variance 



(n + 1)/12 for n > 2, it follows that 



KW _ {n+l) 



Thus 



y^^Pm{w)d{w) 



and also that 



K{^) 



(3n2 + n-2)/12. 



n — 1 



and 



Y,P^iw)d{w)[d{w) + l] 



3n^ + n - 2 n-2 



12 



Qm? 



and the result follows. D 

Remarks. 

• Part 1 of Corollary |3.11 can be proved without generating functions simply by noting 
that by the way the shelf shuffler works, w and its reversal are equally likely to be 
produced. 



• Theorem 3.10 has an analog for ordinary riffle shuffles which is useful in the study 



of carries in addition. See Diaconis and Pulman (2009a) for details. 



4. Iterated shuffling 



This section shows how to analyze repeated shuffles. Section 4.1 shows how to combine 



shuffles. Section 4.2 gives a clean bound for the separation distance. 



4.1. Combining shuffles. To describe what happens to various combinations of shuffles, 
we need the notion of a signed m— shuffle. This has the following geometric description: 
divide the unit interval into sub-intervals of length — ; each sub-interval contains the graph 
of a straight line of slope ztni. The left-to-right pattern of signs is is indicated by a vector 
X of length m. Thus if m = 4 and x = + + ++, an x— shuffle is generated as shown on the 
left side of Figure 2 If m, = 4 and x = -\ h, the graph becomes that of the right side 



of Figure 2 Call this function /^ . 

The shuffle proceeds as in the figure with n points dropped at random into the unit 
interval, labeled left to right, 2/1,2/2, ■ • • , 2/n and then permuted by fx- In each case there is 
a simple forward description: the deck is cut into m piles by a multinomial distribution and 
piles corresponding to negative coordinates are reversed. Finally, all packets are shuffled 
together by the GSR procedure. Call the associated measure on permutations Px- 

Remark. Thus, ordinary riffle shuffles are ++ shuffles. The shelf shuffle with 10 shelves is 
an inverse ^ 1 — • • • H — (length 20) shuffle in this notation. 
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Figure 2. Left: Four peaks; right: m = 4, x 



The following theorem reduces repeated shuffles to a single shuffle. To state it, one piece 
of notation is needed. Let x = (xi,X2, • . . ,Xa) and y = {yi,y2, ■ ■ ■ ,yb) be two sequences 
of ± signs. Define a sequence of length ab as x * y = y^^ , y^^ , • • • , 2/^" with (yi , . . . , y^) ^ = 
(yi, . . . , yt) and (yi, . . . , yb)~^ = {—yt, —yb-i, ■ ■ ■ , —yi)- This is an associative product on 
strings; it is not commutative. Let Px be the measure induced on Sn (forward shuffles). 

Example. 

(+ + +)* (++) = + + + + ++ 

(+-)*(+-) = + -+- 
(+-) * (+ + -+) = + + - + -H 

Theorem 4.1. If x and y are ±1 sequences of length a and h, respectively, then 



Proof. This follows most easily from the geometric description underlying Figure 1 and 



Figure 2 If a uniformly chosen point in [0, 1] is expressed base a, the "digits" are uniform 
and independently distributed in {0, 1, . . . , a — 1}. Because of this, iterating the maps on 
the same uniform points gives the convolution. The iterated maps have the claimed pattern 
of slopes by a simple geometric argument. D 

Corollary 4.2. The convolution of k -\ — shuffles is a -\ 1 — •••H — (2 terms) shuffle. 

Further, the convolution of a shelf shuffler with nii and then m,2 shelves is the same as a 
shelf shuffler with 2mini2 shelves. 

4.2. Bounds for separation distance. The following theorem gives a bound for separa- 
tion (and so for total variation) for a general Px shuffle on Sn- 

Theorem 4.3. For any ±1 sequence x of length a, with Px the associated measure on Sn, 



and sep{Px) from (2.1), 
(4.1) 



n-l 



sep(p,) < 1 - n ^ 



Proof. It is easiest to argue using shuffles as in Description 1 There, the backs of cards are 
labeled, independently and uniformly, with symbols 1, 2, . . . , a. For the inverse shuffle, all 
cards labeled 1 are removed, keeping them in their same relative order, and placed on top 
followed by the cards labeled 2 (placed under the Is) and so on, with the following proviso: 
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if the ith coordinate of x is —1, the cards labeled i have their order reversed; so if they are 
1, 5, 17 from top down, they are placed in order 17, 5, 1. All of this results in a single 
permutation drawn from P^. Repeated shuffles are modeled by labeling each card with a 
vector of symbols. The fcth shuffle is determined by the kth coordinate of this vector. The 
first time t that the first k coordinates of those n vectors are all distinct forms a strong 



stationary time. See Aldous and Diaconis (1986) or Fulman (19981) for further details. The 
usual bound for separation yields 



sep{Px) < -Pjall n labels are distinct}. 



The bound (4.1) now follows from the classical birthday problem. 



Remarks. 



D 



n(n — l) 
2a , 



For a large with respect to n, the right side is well-approximated by 1 - 

This is small when n^ <^ a. 

The theorem gives a clean upper bound on the distance to uniformity. For example, 

when n = 52, after 8 ordinary riffle shuffles (so x = + + ••• + +, length 256), the 

bound (4.1) is sep{Px) < 0.997, in agreement with Table 1 of Assaf et al. (2011). 

For the actual shelf shuffle with x = -\ ! — ■ ■ ■ -\ — (length 20), the bound gives 

sep(P^) = 1 but sep(P^ * P^) < 0.969 and sep^P^ * Px * Px) < 0.153. 

The bound in Theorem |4.3| is simpl e and genera l. However, it is not sharp for the 

original shelf shuffler. The results of Section 3.3 sho w th at m = cn^''^ shelves sufflce 

to make sep(Pm) small when c is large. Theorem 4.3 shows that m = en? steps 

sufflce. 



5. Practical tests and conclusions 

The engineers and executives who consulted us found it hard to understand the total 
variation distance. They asked for more down-to-earth notions of discrepancy. This section 
reports some ad hoc tests which convinced them that the machine had to be used differently. 



Section 5.1 describes the number of cards guessed correctly. Section 5.2 briefly describes 
three other tests. [Section 5.31 describes conclusions and recommendations. 

5.1. Card guessing with feedback. Suppose, after a shuffle, cards are dealt face- up, one 
at a time, onto the table. Before each card is shown, a guess is made at the value of the 
card. Let Xi, 1 < i < n, he one or zero as the ith guess is correct and T„ = Xi + •••-!- Xn 
the total number of correct guesses. If the cards were perfectly mixed, the chance that 
Xi = 1 is 1/n, the chance that X2 = 1 is l/(n — 1), . . . , that Xi = 1 is l/(n — i + 1). 
Further, the Xi are independent. Thus elementary arguments give the following. 

Proposition 5.1. Under the uniform distribution, the number of cards guessed correctly 
Tn satisfies 

• E{Tn) = ^ + ^H hl~logn + 7-FO(^) with 7 = 0.577 Euler's constant. 



var(r„) 



1 



1 



1 



1 
n-1 



+ 



1 



log n + 7 



+ 



Normalized by its mean and variance, T„ has an approximate normal distribution. 



When n = 52, T„ has mean approximately 4.5, standard deviation approximately 
and the number of correct guesses is between 2.7 and 6.3, 70% of the time. 



Based on the theory developed in [Section 3| we constructed a guessing strategy 
jectured to be optimal — for use after a shelf shuffle. 



^2.9, 



con- 



Strategy 



To begin, guess card 1. 
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If guess is correct, remove card 1 from the list of available cards. Then guess card 

2, card 3, .... 

If guess is incorrect and card i is shown, remove card i from the list of available 

cards and guess card i + 1, card i + 2, .... 

Continue until a descent is observed (order reversal with the value of the current 

card smaller than the value of the previously seen card) . Then change the guessing 

strategy to guess the next-smallest available card. 

Continue until an ascent is observed, then guess the next-largest available card, and 

so on. 

Table 2. Mean and variance for n — 52 after a shelf shuffle with m shelves under 
the conjectured optimal strategy. 



m 


1 


2 


4 


10 


20 


64 


mean 
variance 


39 
3.2 


27 
5.6 


17.6 
6.0 


9.3 

4.7 


6.2 

3.8 


4.7 
3.1 



A Monte Carlo experiment was run to determine the distribution of T„ for n = 52 with 



various values of m (10,000 runs for each value). Table 2 shows the mean and variance for 
various numbers of shelves. Thus for the actual shuffler, m = 10 gives about 9.3 correct 
guesses versus 4.5 for a well-shuffled deck. A closely related study of optimal strategy for 



the GSR measure (without feedback) is carried out by Ciucu (1998). 



5.2. Three other tests. For the shelf shuffler with m shelves, an easy argument shows 
that the chance that the original top card is still on top is at least l/2m instead of 1/n. 
When n = 52, this is 1/20 versus 1/52. The chance that card 2 is on top is approximately 
^ ~ 2m) '^tiils the chance that card 2 is second from the top is roughly tttitw. The 



1 

2m 



(2my 



same probabilities hold for the bottom cards. While not as striking as the guessing test of 
Section 5.1, this still suggests that the machine is "off." 



Our second test supposed that the deck was originally arranged with all the red cards on 
top and all the black cards at the bottom. The test statistic is the number of changes of 
color going through the shuffled deck. Under uniformity, simulations show this has mean 
26 and standard deviation 3.6. With a 10-shelf machine, simulations showed 17 it 1.83, a 
noticeable deviation. 

The third test is based on the spacings between cards originally near the top of the 
deck. Let Wj denote the position of the card originally at position j from the top. Let 
Dj = \wj — Wj-^-i\. Figure 3 shows a histogram of Dj for 1 < j < 9, from a simulation with 



n = 52 based on a 10-shelf shuffler. Figure 4 shows histograms for the same statistics for a 
well-shuffled deck; there are striking discrepancies. 



5.3. Conclusions and recommendations. The study above shows that a single iteration 
of a 10-shelf shuffler is not sufflciently random. The president of the company responded 
"We are not pleased with your conclusions, but we believe them and that's what we hired 
you for." 

We suggested a simple alternative: use the machine twice. This results in a shuffle 
equivalent to a 200-shelf machine. Our mathematical analysis and further tests, not reported 



here, show that this is adequately random. Indeed, Table 1 shows, for total variation, this 
is equivalent to 8-to-9 ordinary riffle shuffles. 
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Figure 3. 9 spacings from a 10-shelf shuffle; j varies from top left to bottom right, 
1 < j < 9. 
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1 < j < 9. 
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